DOCOBBHT BESOHB 



BD 090 236 



95 



SP 007 962 



AUTHOP 
TITLE 

mSTITOTIOH 

SPONS AGENCY 
POB DATE 
GRANT 
NOTE 

EDBS PRICE 
DESCRIPTORS 



Hilkins, Nilliaa E. 

Teacher Expectations and Classrocs Behaviors. 
State Dniv. of Nev York, Brockport. Coll. at 
Brockport* 

Office of Education (DHEN) , Washington, D.C. 
[72] 

OEG-IB-050 
33p« 

HF-$0.75 HC^$1.85 PLOS POSTAGE 
^Acadenic Achievement; ^(Educaticnal Research; 
^'Student Ability; Student laproveaent; ^'Student 
Motivation; Student Teacher Relationship; ^Teacher 
Attitudes; Teacher Behavior 



ABSTRACT 

To test the effect of teacher expectation on pupil 
achieveient, subjects in 2U classrooms, grades 1-6, a) completed the 
Hetropolitan Analysis of Learning Potential and the Metropolitan 
Achievement Test; b) completed a questionnaire regarding perceptions 
of teachers* differential expectations and treatment of students; and 
c) were ranked by their teachers according to their expected 
achievement growth during the year* Observers made regulars visits to 
the classrooms to measure pupil-teacher interactions* The five 
experimental hypotheses tested were a) experimental subjects vill 
perform better on the ability posttest than the control subjects; b) 
experimental subjects vill score higher on the achievement posttest 
than the control subjects; c) expericental subjects in the middle 
ability range vill shov greater improvement than the control subjects 
in the same ability range; d) experimental subjects vill receive 
significantly different instruction than the control subjects as 
measured by independent observers; and e) pupils vill report 
significantly different treatment of control and experimental 
subjects* Analysis of the collected data shovs no significant 
difference betveen the control and experimental groups on ability and 
achievement posttests and suggests rejection of hypotheses "a,*^ "b,^ 
**c," and e*** Although there vas a difference in the instruction 
received by different ability groups, there vas no difference betveen 
that received by control and experimental subjects iu the same 
ability grouping, and hypothesis "d" vas rejected as untestable* 
(HHD) 
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Introduction : 

The publication of Pypriallon In the Clnasroora (Rosenthal and Jacob-* 
8on» 1968) has led to a flurry of roscarcli and discuanion on teacher ex*- 
pcctation and their effects on the academic and social behaviors of 
students. Critics of the Rosenthal research on experimenter bias and 
teacher expectations have sufjf^ested that the flndinf^^s reported by Rosen- 
thal and his co--\;orkers nay be generally untrustxTorthy • Conceptual dif- 
ficulties and inethodol61oRical errors of the kinds sup^ested by Barber 
and Silver (1968), Thomdike (1963) and Elashoff and Snot^ (1971), 
among others. Indicate .that the xdde publication and use of the Rosenthal 
and Jacobson findings is not justified without further documentation. 

At least ten other attempts to raise IQ scores throuf;h exnectancy 
effects have been reported (Conn, et al. , 1960; Evans and Rosenthal » 1969; 
Clairborn, 1969; Jose and Cody, 1971; Fleminp, and Anttonen, 1971; Raster, 
1971; Goldsmith and Fry, 1970; Anderson and Rosenthal, 1968; and Pel- 
legrini and Hicks, 1972). In no case did the findings support the Oak 
School Experiment. 

On the other hand, t!ie research usinp, achievement gains as dependent 
variables sugf^ests that teacher expectancies probably have some effect 
on student achievement under certain naturally occurring conditions or 
under strong experimental manipulation. R±st*s findings (1970) sug- 
gest that teacher exnectancy affects achievement insofar as the teacher 
refuses to interact with and teach certain students because of culturally 
established expectations. Itelchenbaum, Brnzers, and Roas (1969) dem- 
onstrated the self-fulfilling effect using fourteen female adolescent 
juvenile offenders. In this experiment, objective and subjective exams 



prepared by the teachers yielded the dependent variables. Schrank (1963» 
1970) reported D/o successful expectancy studies uslnR colloBe freshmen^ 
final test scores on a mathematics test served as the criterion measures 
In both experiments. Palardy (1969) demonstrated that teachers* ex-* 
pectatlons concerning the probability that boys and f^lrls v;lll learn to 
read was related to the readlnje; achievement test scores of their stu-* 
dents. Other studies have shown that expectations relate to the number 
and quality of teacher-*pupll Interaction (e.f». Rothbart» et al.t 1971; 
Rubovlts and.Haehr, 1970; Good, 1970; and Brophy and Good, 1970). 

In sum, the literature sug(>ests that teaclier expectations may have 
an effect on some of the behaviors and nerfornances of their students. 
Iloi^ever, the same literature susgests many obvious contradictions which 
warrant further study. The present study tras dcnlgned to Investigate 
the relationships between teacher expectations and student IQ, student 
achievement, and the number and quality of pupll-taacher Interactions. 
Three general problems >7ere considered In f^eneratlng testable hypotheses: 
1) Does experimental manipulation of teachers' expectations lead to 
Increases In pupils' ability and achievement scores?; 2) If any score 
Increases do occur, are thsy a function of the self-fulflllng prophecy 
or simply the result of different Instructional pron;rams?; and, 3) If 
any Increases In scores do occur, and If they are the result of the sel^- 
f)ilfllllng prophecy, can we Identify some of the processes Involved? 

Ample evidence has been presented to suf^gest that the results re*- 
ported by Rosenthal and Jacob son (1968) nay well be spurious. It there-* 
fore seems clear that a method olof;lcally sound replication Is warranted. 
Since a major portion of this experiment and the entire Rosenthal version 



of teacher expectancy hln<^e8 first cn Increases In criterion measures as 
a result of experimental manipulation of expectancy, clear and unimpeach- 
able Increases in achievement and ab/.llty ocores must be demonstrated. 
A second problem relates to the possibility that the teacher may en^a^e 
in different modes of Instruction for these nuplls who have been specifl-* 
cally classified. That is, If teachers expect a particular pupil to do 
especially well, he may place him in an instructional group which has 
better pupils, better instructional materials, and possibly different modes 
of instruction. Pupils a;ho had not been marked as deserving special con- 
sideration may not be given this attention. 

In short, the gains in IQ and achievement xohich Rosenthal; and 
Jacobson interpreted as expectancy effect caused by some subtle communis 
cation system may well have been the result of placin:^ the experimental 
students Into one instructional f^roup rather than into another. This 
rationale suggests that, if there is a subtle communication system in- 
volved In the self-fulfllllnr; prophecy, it may not be as important as the 
original instructional placement of tha students by the teacher. The 
interpretation posited here may help explain why teachers could recall 
or recognize the names of only a few of the "bloomers** in the Oak School 
Experiment (Rosenthal and Jacobson, 1963). 

Furthermore, Rosenthal and Jacobson report th^t experimental Ss in 
the middle range of ability made greater gains in reading achievement than 
either the lew ability group or the high ability group. Assuming that 
original instructional placement ia ncre important than the subtle com- 
munication system, this finding suggests that tancliers rtwo-re icadiiy be- 
lieve that they could mistakenly place an average pupil in the xnrong group 



but not a very comi^etent pupil or a very Incoropetent pupil. 

The third problem to be investir^ated is under the rubric of the 
process of the self -fulfilling prophecy and concerns the pupils* and the 
teachers* perceptions of expectancy. Tliat is, are the expectancies con- 
scious on the part of those involved? Specifically, (1) do the teachers 
have the expectations during and at the end of the treatment period; 
(2) do the matched pupils differ from their controls in their perceptions 
of v;hat the teachers expect for then and for their matched counterparts; 
and (3) do the other pupils In the class recor^.nize a different communica- 
tion system between the teacher and the experimental subjects and the 
teacher and the matched control subjects? In other words, pupils must 
perceive that the teacher expects different achievement and behavior 
among them before a pupil can make a response contingent on the teacliers* 
expectations. Therefore, teachers* expectations must be revealed in some 
form so that pupils can perceive differential treatment. 

A review of previous research did not yield information directly con-- 
cemed with pupils* abilities to perceive teacher expectations and dif- 
ferential treatment. Hovrever, indirect support can be taken from Flan- 
agan and Havumaki (1960) who demonstrated that tenth <;rade pupils praised 
more often by the experimenter x^ere chosen more frequently on a soclogram. 
Beck (1964) reported the development of a pupil perception of teacher 
behavior questionnaire, thereby demonstrating that Intermediate level 
elementary school children could, and do, perceive at least eleven spe- 
cific teacher characteristics. Lippitt and Gold (1959), and Fox, Lip- 
pit t and Schmuck (1964) found that elementary school children perceived 
seventeen specific characteristics of their peers and that students uti- 
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llzed these perceptions in conceptualizing the power structure of the 
classroom. Furthermore » these perceptions remained consistent through- 
out the school year. 

In sum, It appears that elementary school children make accurate 
perceptions of subtle affective and cof;nltlve behaviors of their teachers 
and their peers. Furthermore, It seems reasonable that pupils can per- 
ceive differential expectations of the, teacher and that these perceptions 
may be a vital link In whatever communication system may be Involved In 
the self -fulfilling prophecy process. 

Within the framework of these general problems, five specific 
hypotheses were tested: 

1. Experimental Ss will shm significantly better performance on 
the ability post test than the matched control Ss. 

2. Experimental Ss will shor; significantly better performance 
on the achieveme t posttest than the matched control Ss, 

3. VJhen compared with their matched control subjects, experimental 
Ss in the middle ranges of ability xd.ll make significantly 
greater improvement in achievement and ability than either the 
high ability experiemntal Ss or the loi7 ability experimental Ss. 

4. Experimental Ss will receive significantly different instruction 
f.rom that of the matched control Ss as measured by group class- 
ification and ratings by independent observers. 

5. Pupils will report, on a questionnaire measuring pupils* per- 
ceptions of teacher behavior, significantly different pupil- 
teacher Interactions between experimental Ss and the teacher as 
compared with the interactions of matched control Ss and the teacher. 

ERIC 
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Procedure : 

1 . Selection of Subjects and Experimental Design 
All students In twenty-four classrooms » (>rades l-*6 (four class- 
rooms at each grade level) were administered the Metropolitan Analysis 

1 

of Learning Potential^ and the Metropolitan Achievement Test. Students 
v;ere rank orderdd according; to their achievement test scores and their 
ability test scores. Classroom teachers were asked to rank their students 
according; to how much achievement f^outh they expected from them during 
the year. Three matched pairs were then selected from each classroom^ 
one pair to represent each third of the distributions. That is, 
one experimental pair was selected if each member has an achievement score 
in the top third of the achievement distribution » if each had a pot-* 
ential score in the top third of the ability distribution » and if each 
had been ranlced by his teacher in the top third of her achievement groiTth 
prediction distribution » etc. One further quailf Icatlon for sampling 
was that the pairs were matched by sex. Furthermore » there was an equal 
number of males and females at each ?>rade level » but not for each class 
in the sample. After the matched pairs had been selected » one member of 
each pair x^as randomly assigned to le experimental group with the other 
member becomihg part of the control group. Selection of Ss and imple- 
mentation of she study was completed within the first four weeks of the 
fall term. Thus» there TTere twelve experimental Ss and twelve control 
Ss at each of the six grade levels » or a total of seventy-two experimental 
subjects matched with seven ty-'tx^o*" control subjects* Classrooms were sel- 
ected from two school districts on the basis cf size and geographical 
location. Prior to the experiment^ a questionnaire to determine pupils' 
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perceptions of teachers ' differentifd. expectations and treatment of 
students had been devised and tested for feasibility in classrooms 
not to be used in the study. The instrument vas administered 
to all students in the twenty-four participating classrooms at the 
beginning of the school year. Trained observers made regular visits 
to selected classrooms to use the observation scale devised by Rist 
(1970) as an added measure of pupil-teacher interactions. 

It has been hypothesised that dustructioniSLL groupings of 
students may be more important than the communication system bet- 
ween the students and teacher. Tlierefore, a documentation of the 
grouping of experimental and ccu:.v.:.l subjects was made. 

f 

2. Inducing Teacher Expectancy 

It was noted earlier that each classroom teacher was asked to 
rank his students according; to how much he expected them to grow 
in achiervement . After the experimental subjects were selected , 
the investigator had a conference with each teacher for the sup- 
posed purpose of discussing the differences between his ranking and 
the test scores. Because of the wide spread popularization of the 
Rosenthal findings , teachers were told that further reaearch into 
the problems of achievement testing was being done. Using this as 
a ploy, the investigator pointed out to the teacher, during the con- 
ference, that the scores obtained on the two tests indicated that 
he had seriously underestimated the achievement potential of sev- 
eral of his pupils. Further, he vas informed that we would like 
his cooperation in trying to validate our test scores by classroom 



observatibn; the investigator then informed him that three pupils 
in his classroooiy as veil as three pupils in each of several other 
classrooms 9 had been singled out for class study • He was then 
given the names of the three experimental subjects and an expectancy 
inducement. That is, it \r*.8 pointed out that while he may have 
ranked subject one ob likely to grow, our tests indicated that this 
student should grow or improve much more rapidly than even he had 
first anticipated. He was then told that subjects two and three rep- 
resenting the middle third and the bottom third of his ranking, 
ought to improve much more than he expected. In short, he was strongly 
induced to believe that these three students were going to make 
exceptional gains during the course of the year. 

3. Observation and Post-Testinfy 

Taachers were informed that observers \70uld make regular visits 
to his classroom to see if the responses and school work which the 
experimental Ss produced in the classroom were eonsistent vitJi res- 
ponses th&t they made on the tests. It was also anitlcpated that 
the regular visits of these observees would serve as a reminder to 
the teacher and therby reinforce the expectancy. 

During the entire school year, one of three observers made 
weekly observations of approximately forty-five minutes each in each 
of the twenty-four classrooms. Observers used the nine item teacher- 
pupil interaction scale developed by Rlst (1970). Observers also 
noted changes in grouping, and other informal observations about 
the general atmosphere and operation of the classroom. 



At the end of the achool year, the aeventy-two e3q)erimental Sa 
and the aeventy-two control Sa were re-odminlatered the Metro- 
politan Achievement Teat and the Metropolitan Analyaia of Learn- 
inc Potential, The pupil perception of teacher behavior queation- 
naire, developed by the inveatigator, waa re-adminiatered to all 
of the children in all twenty-four claaarooma. After all poat- 
teating was completed, the teachera were informed of the actual 
purpoae of the experiment. 

Reaulta; 

The effecta of the experimental raariipulation of teacher expec- 
tationa waa tested on eight dependent variablea. Theae criterion 
meaaurea covered a variety of academic and aocial factora. The 
mean acorea and atandard deviationa for each meaaure are preaented 
in the following tablea. Note that theae deacriptive atatiatica 
are coiaputed by ability level within grade level for the control Sa 
and the experimental is tor. eftoheof;:the eight dependent variablea. 

— INSERT TABLES 1 to 9 ABOUT HERE 

Careful inapection of these Tablea auggesta that aignificant dif- 
ferences between experimental Ss and control Sa are not likely ex- 
cept on the Obaerved Interaction variable. 

Differencea on all variables itrore teated for aignificance 
by analysia of variance. Of particular concern for each dependent 
variable is the comparison betwenn experimental and control aub- 
Jecta and interactiona involving the treatment factor, flatched 
paira within grade-ability level-treatment (df=72) aerved aa the 
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error term for all comparisons. Since exact matching of achieve- 
ment test scores was not always possible because of other matching 
criteria 9 the pretest scores on the Metropolitan Achievement Test 
and the pretest scores on the Analysis of Learning Potential 
Test were treated as cover lates In all anULyses. Tvo separate 
analyses were done. The first encompassed grades 1-4 Inclusive 
and the second encompassed grades 3-6 Inclusive. Of the avail- 
able computer program opt Ions » It ;/3o concluded that retalnlug 
the two covarlates was mere valuable than analyzing all six 
grades simultaneously without controlling for pretest differences. 
Therefore 9 achievement gains served as the first dependent 
variable and ability gains served ao the second criterion 
measure with the pretest scores on each measure being treated as 
a covarlate. 

The third dependent variable tested was termed positive 
Interaction. These data were derived from observations of pupil- 
teacher Interactions, The observation scheme used was developed 
and reported by Rlst (1970). The clearness of the category para- 
meters , together with pre-observatlou dlscusslcns, led to consis- 
tent recording of Interactions. Three kinds of positive Inter- 
actions-Verbal » non-verbal^ and physical— were recorded and 
combined Into a positive Interaction score for each control sub- 
ject and each experimental subjectf 

Tlie fourth dependent variable » neutral Interaction » was 
derived In the same manner as the third except that Interactions 
defined as neutral were recorded. Variable number flve^ negative 
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Interactlon^ parallels variables three and four except that 
negative interactions » including verbal » non-verbal, and physical 
were recorded. The sixth dependent variable was the cjxnbined 
interaction score for each ^s. All interactions » from all three 
modes of Interaction (physical, non-vorbal, and verbal), were 
added into a single score. 

The final two variables, sociometric-pre and sociometric- 
post, were used in testing Hypothesis 5. The sociometric instru- 
ment used to generate this data was developed by the writer. 
The results of a factor analysis (principal component method) of 
the twenty-three item scale yielded a principal component com- 
prised of thirteen items. The scores used in the data analysis 
are the sum of the number of times c^ach control subject and each 
experimental subject was chosen by anyone in his classroom on the 
thirteen Items which clustered on the principal component. 

As suggested by the tables of means and standard deviations 
(presented above) the analysis of variance resulted in a small 
nuQiber of significant F ratios. In fact, of the 92 F ratios com- 
puted, only 12 reached acceptable levels of significance « Of the 
12 significant F-ratios, three, regarding over-all achievement 
gains for the entire sample were relatively uninteresting. These 
differences, on Variables 1 and 2 Indicate more variance betvjeen 
the pretest scores and the posttest scores than within group 
variance. This difference was found for Grades 1-4 on both the 
achievement measure and the ability measure (F « 21.36, £ < .001 
and F. - 18.45, £ < .001 with 3, 10 df respectively). For the 

o 

ERIC 



-12- 



Gradea 3-6 analyais^ the difference was found only for ability 
scores (F - 6.03, £ < .013 with 3 10 df). These findings probably 
suggest the obvious; namely, that all ^s made significant acheive- 
ment gains over the course of a year. 

Analysis of data by ability level within grade level yielded 
only one significant F-ratio. This F Indicated a difference in 
the number of pupil-teacher interactions for experimental and 
control groups combined for grades 1-4 (F^ » 3.22, £ < .020, 6, 
22 df). Inspection of the mean scores presented previously re- 
veals that, for grade 1 S^s, hish ability students had more inter- 
actions than did their low ability ne^irs. This finding is lu 
agreement with other findings reported In the literature (e.g.. 
Good and Brophy, 1970). For grade 2 Ss, findingo indicated that 
average ability students and low ability students had significantly 
more interactions with their teachers than did high ability 
students* In grade 3, low ability students interacted more with 
theli: teachers than either averase ability students or high ability 
students. Tliese findings are incongruent with others previously 
reported. One possible explanation involves differences In in- 
structional methods used with different ability groups. That Is, 
very often teachers would assign high ability students indepen- 
dent projects and then leave then to their own initiative while 
she spent more time in more directed activities with average and 
low ability students. An lnspec*:lon of scores for grade 4 ^s 
revealed no significant dlf ferencoa in the number of pupil- 
teacher interactions for the various ability levels. 
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When the data were analyzed by ability level without regard to grade 
level, two significant F-ratios were found. Doth of these differences 
involved the observed interactions between pupils and teachers. For 
Grades 1-U, differences were foiind in the total number of pupil-teewher 
interactions (F = U.75» p. less than .019, 2, 22 df). Inspection of raw 
scores indicates that low ability students had the most interactions with 
their teachers ^ followed by the averai^e ability students, with high ab- 
ility students, having the fewest interactions with their teachers. 

For Grades 3-6, significant differences involved the 3>ositive inter- 
action variable (F = U.U8, p. less than .023, 2 22 df). Further einalysis 
revealed that. t>w ability students had the largest number of positive 
interactions with their teachers, average ability students had the smal- 
lest number of 'positive interactions with their teacher; and high ability 
students had more positive interactions with their teacher than did the 
average ability student but fewer than the loir ability students. 

The explanation for these findings is co3GiJect'ara. The independent 
study hypothesis offered above migiht also apply to these results. Another 
possible alternative is that the absolute nmber of interactions is an 
artifact of the observatinn time, \lhile matched pairs were observed for 
an equal amount of time, it is conceivable that observation time across 
girade level and within ability group, but without regard to the matched 
pair criterion, may not have been equal. Thus, low ability students may 
have been involved with the teacher for a larger share of the observation 
time than either of the two groups. This possibility is partiularly 
appropriate to the lower grades where a large amount of instructional 



time is spent in small (proup instmiction. 

Testing for the treatment effect, experimental manipulation of teacher 
expectancy, between experimental Ss and control Ss revealed no signi- 
ficant F-ratios. The failure to find any differences Involving the treat- 
ment factor on any of the various academic and social variables suggests 
rejection of Hypothesis 1, hypothesis 2, and Hypothesis These results 
would be consistent with the majority of studies discussed previously. 

When treatment effects were tested without regard to grade level, 
five significant F-ratios were obtained. These positive findings also 
inovlved the observed pupil -teacher interactioh . Inspection of the raw 
data indicated that experimental subjects had more positive and neutral 
interactions with their teachers than the control subjects in Grades 1-U 
when grade level and ability level were not considered (F = 5.30, p. less 
than .Okk; F = 13.28, p. less than .005, 1, 10 df respectively). An 
examination of the raw scores relevant to Grades 3-6 suggests that ex- 
perimental Ss had more total interactions (F = 9.07^ P^ less than .013, 
1, 10 df) as well as more neutral interactions (F = 6.75, p. less than 
.028, 1, 10 df) and more positive interactions (F = 5.66, p. less than 
.039, 1» 10 df) with their teachers when compared with the control Ss. 
litoile these findings seem to support the general thesis of teacher expec- 
tancy effects, an accurate interpretation of these findings is unlikely. 

Since the design utilized in this experiment T/as specifically devel- 
oped to test id.thin grade level differences, diffeuences across grade levels 
cannot be accounted for. It is possible that the iifferenccs across 
grade levels is a function of between grade variance rather than between 
experimental Ss and control Ss variance. In any event, these findings 
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are at best weakly supportive of Hypothesis 5* 

\ftien treatment effects were tested within grade level and by ability, 
no significant differences were obtained between the matched pairs. The 
failure to find interaction effects nzles out the possibility of ac- 
cepting any specific hypothesfes. Tlae lack of significant interaction 
effects for the treatment factor, combined with the failure to find any 
significant grade level-treatment factor, combined with the failure to 
find any significant grade level-treatment interaction effects forces 
a rejection of all liypotl.eses except number k. Since Hypothesis k 
concerns differences in instructional methods rather than differences on 
any of the eight dependent variables, it has yet to be considered* Be- 
fore considering data relevant to this hypothesis, some closure on the 
analysis of variance is necessary. A single significant F-ratio was ob- 
tained when the treatment effects for different ability levels was 
tested without regard for grade level. Again, the observed pupil- 
teacher interaction variable is involved (F « 9.^1, p. less than .001, 
2, 22df). Analysis of raw scores indicate that low ability Ss had the 
largest number of positive interactions with their teachers, followed by 
the average ability groups, tod then the high ability group for Grades 
1-1* • No corresponding F ratio was found in the Grades 3-6 analysis. While 
this two-factor interaction effect is more suggestive than the main 
effects ra;?* ted previously, it is not amenable to specific interpreta- 
tions. At best, it suggests that students across four grade levels have 
more positive interactions with their teachers if they are singled out 
for special treatment. This interpretation must be tempered by the fact 
that grade level differences were not accounted for; therefore, a con- 
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clusive statement cannot "be made. 

Since no significant achievement gains or ability gains were found. 
Hypotheses regarding olfrecences in instructional methods for exper- 
ijnian-bofcX Ss when compcured to control Ss, had no criterion measures on 
which to be evaluated. However, caref\il documentation of within class- 
room instructional methods and ability grouping revealed no significant 
differences between the teaching of eacperimental Ss and the teaching of 
control Ss. While differences between ability groups was noted (eg., more 
independent study for "brighter" students,), there was no indication 
that matched pairs within a given ability lever were treated differently. 
High ability control subjects participated in as xoany independent projects 
as did their experimental counterparts. 

It should be stressed that these findings do not rule out the pos- 
sibility that achievement differences reported by other investigators 
may have been the result of differences in instructional methods rather 
than experimental manipulation of teacher expectancies. The failure to 
find achievement differences in the present stxidy makes it impossible 
to rule out the instructional methods hypothesis. 

In sum, data analysis revealed that only 12 of 92 F tests compited 
reached acceptable levels of significance. Three, regarding over-all 
achievement gains, wejre relatively tininteresting. Of the remaining 9» 
8 were generally uninterpretable main effects involving observed pupil- 
teacher interactions. One ability level-treatment interaction effect 
was reported. This effect was not amenable to concise Interpretation. 
Thus, all experimental hypotheses were rejected, or, in the case of Hypo- 
thesis untestable. 
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Summary__and Conclusions ; 

The failure tc find any significant effects of experimentetl man- 
ipulation of teacher e3q)ectahions leads this investigator to concur 
with others in regarding the Rosenthal and Jacobson findings spurious . 
In a relatively well executed randomized block design, the failure to find 
interaction effects supporting the specific hypotheses for any of eight 
variables leads to the concliision that teachers' expectations, and sub- 
sequent behavior^ which may affect achievement, eannot be. altered by the 
simple process of showing teachers cognitively inconsistent test scores. 
That is, while teachers' expectations may well affect the social and 
achievement behaviors of students, the etiologiceJ- factors are varied 
and complex. One inconsistent test score cannot dispel other social 
and academic factors which iiapinge upon the teacher and serve as the 
basis for expectations. 

Although some investigators have reported gains in achievement test 
scores as a result of expectancy inducement, the majority of negative find- 
ings suggest that significant positive findigns are either situation- 
specific or chance findings. 

Since no significant differences in instructionea methods for groups 
within classrooms was observed, except presenting the same material at 
a slower pace, hypot> es concerning modes of instruction cannot be 
evaluated. Purt since grouping and regrouping for the control 

Ss and the experimental Ss was at a minimuiu in all classrooms, no basis 
for the assessment of the group placement hypothesis is available. Since 
no significant results in criterion measures were found, the effects of 
group placement and instructional methods could not be determined. 
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It could be argued, however, that the lack of regrouping was one of the 
chief reasons that there were no increases in the achievement posttest • 
That is, if group placement is a major cause in making the teacher's ex- 
pectance come true, the failure of the teacher to regro\ap experimental 
subjects could have contributed to the lack of significant differences 
in achievement meastires* 

Based on the findings reported in this study, the other findings 
reported in ^post Rosenthal and Jacobson studies, and an analysis of the- 
oretical positions regarding the self-fulfilling prophecy presented else- 
where (Wilkins, 197^*), it is the opinion of xhis wuiter that alternative 
modes of investigation be pursued. It seems clear that a teacher's 
expectations are founded on a variety of complex and interrelated factors 5 
and it is naive to assume that shovring a teacher one or two scores on a 
standardized achievement test will have any effect on his original 
expectations. It is recommended that observational studies, following 
Rist (1970) and historical studies, after Seaver (l97l) be undertaken in 
an effort to identify the various social, psychological, and academic factors 
utilized 'by teachers when they set expectations. Wiile this writer 
still holds the original claim that teacher expectations affect pupil 
behavior and achievement, atteD5)ts to experimentally manipulate or alter 
those expectations are premature until the basis for and the operations 
of teachers' expectancies is ascertained* Candidly .j, it seems unlikely 
that we can experimentally manipulate teachers' expectations if we do 
not know what factors or antecedents to manipulate* 

Another avenue of investigation which should be p^irsued was suggested 
liy Finn (l972). Quite correctly, Finn notes that the teacher is only one 



(perhaps minor) source of expectancy for the Student. Parental expec- 
tations,, -peer expectations, Bni self -expectations are also impinging on 
the pupil's behavior and achievement* A wide variety of empirical ques- 
tions are readily obvious. lb one source more potent than the others? 
What is the effect of incongruent expectancies from different sources? 
Does a change in the expectations of one source influence the child's 
perceptions of expectancy from other sources? Indeed, it is possible 
that the predominance of other sources of expectancy makes the teacher's 
expectations rather inconsequent ieuL. 

In conclusion, it seems clear that the expectations of others play 
an influential role in the behavior of the individual, both in school 
and in the larger socieQ. arena. Furthermore i» the baisJiS for the expec- 
tations and the individ\XEj.s ' perception of the various expectations are 
complex and generally virgin areas for investigation. Finally, it seems 
•fair to conclude that investigations following the Rosenthal and Jacobson 
model have demonstrated the model to be naive, and founded on inadequate 
evidence. It is hoped that investigations following the ideas outlined 
above will be undertedcen, and that these investigations irf.ll provide 
an understanding of a potential source of significant educationaJL 
change and improvement. 



The first grade sub J act c found the Metropolitan Achievement Test too 
.diffictilt and frustrating • Therefore, first grade Ss have scores only 
the Analysis of Learning Potential. 
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